SUBSTITUTE SPECIFICATION 

Desaturase Enzymes 



CLEAN COPY 



Field of the Invention 

[0001] The invention relates to transgenic cells transformed with nucleic acid 

molecules which encode enzymes with desaturase activity and the use of these cells and 
enzymes in biocatalysis. 

Background of the Invention 

[0002] Desaturases are enzymes involved in the synthesis of long chain 

polyunsaturated fatty acids (PUFAs). PUFAs are fatty acids (FAs) which are essential 
to the normal functioning of a cell and their nutritional properties are well known. An 
example of a PUFA is docosahexanoic acid (DHA). DHA is a n-3 fatty acid that can be 
obtained directly from the diet or derived from metabolism of dietary linoleic and a- 
linolenic acid. The n-3 fatty acids are associated with health promoting properties. For 
example n-3 fatty acids have been described as anti-inflammatory, antithrombotic, 
antiarrhythmic, hypolipidemic and vasodilatory. As such, the role of DHA in the 
prevention and/or treatment of diseases such as coronary heart disease, hypertension, 
type II diabetes, ocular diseases, arthritis, cystic fibrosis and schizophrenia has been the 
focus of a great deal of medical research. 

[0003] The production of PUFAs involves a consecutive series of desaturations 

and elongations of the fatty acyl chain to generate arachidonic acid (20:4A5,8, 11,14) and 
docosahexaenoic acid (22:6A4,7,10,13,16,19). Several desaturases involved in this 
metabolic process have been isolated from marine microalgae, including Phaeodactylum 
tricornutum [5], Euglena gracilis [6] and Pavlova lutheri [7]. These membrane-bound 
desaturases are specific with respect to both chain length of the substrate and the double 
bond positions on the fatty acid. They belong to the class known as front-end fatty acid 
desaturases due to the fact that they introduce double bonds between the carboxy-group 
and pre-existing bond(s) of the fatty acid [1]. These desaturases contain a cytochrome 
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h5 domain at their N-terminus and three histidine motifs that are important for catalytic 
activity [10]. 



[0004] Desaturase enzymes and the genes which encode them are known in the 

art. For example, WO03/064596 describes, amongst other things, transgenic cells 
transformed with omega 3 and delta 12 desaturase nucleic acid molecules and the use of 
these cells in the production of fatty acids. In particular the use of the omega 3 
desaturase in the conversion of arachidonic acid to eicosapentaenoic acid and the use of 
the delta 12 desaturase in the conversion of oleic acid to linoleic acid. WO03/099216 
also describes fungal desaturases and in particular transgenic plants modified to express 
fungal delta 15 desaturase enzymes. 

[0005] Furthermore, US2003/0157144 and US2003/0 167525 disclose delta 5 

and delta 6 desaturase genes in the conversion of dihomoylinolenic acid to arachidonic 
acid and linoleic acid to y-linolenic acid respectively. Moreover, US2003/ 134400 
discloses delta 4 desaturase genes which are involved in the conversion of adrenic acid 
to co6- docosapentaenoic acid and in the conversion of co3- docosapentaenoic acid to 
docosahexaenoic acid. These rare fatty acids are used in pharmacutical and cosmetic 
compositions and can be essential nutritional fatty acids. 

[0006] Besides the common FAs 16:0, 16:1A9, 18:0 and 1 8: 1 A9 found in most 

living organisms, trace amounts of more unusual fatty acids can be found in a wide 
range of species. For instance, presence of 1 6: 1 A 1 1 has been reported in several species 
of Pavlova, in the Eustigmatophyte Nannochloropsis oculata, and in the diatoms 
Phaeodactylum tricornutum and Thalassiosira pseudonana [11,12,13]. This FA 
accounted for a very small portion of the total FAs in these microalgae, and its specific 
role in the algal cells is unknown. However, this FA is a very important precursor in the 
synthesis of sex pheromones in insects. Sex pheromones are species-specific blends of 
unsaturated fatty acid (UFA) derivatives that differ in terminal functional group and in 
the number, position and configuration (Z or E) of the double bond(s), which are 
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produced by various acyl-CoA desaturases [14,15]. Simple monoene Al 1 UFAs are the 
most prevalent precursors in the formation of major sex pheromome components in the 
modem Lepidoptera [16,17]. For instance, in the com earworm Helicoverpa zea, which 
produces a pheromone mixture of Zll-16:Ald and Z9-16:Ald in a 30:1 ratio, the most 
abundant desaturase-encoding transcript is HzeaLPAQ (also called HzPGDsl) which 
encodes a Al 1 -desaturase that does not possess a cytochrome b5 extension, and 
therefore requires free cytochrome b5 for activity. Many acyl-CoA Al 1 -desaturases with 
different specificities have been isolated from insects [14,15], but none from other 
species. 

Description of the Invention 

[0007] We describe the first characterisation of a cytochrome b5 desaturase 

exhibiting Al 1 -desaturase activity. 

[0008] According to an aspect of the invention there is provided a transgenic cell 

comprising a nucleic acid molecule which comprises a nucleic acid sequence which 
nucleic acid molecule consists of the sequences as represented in Figures 5a, 5b, 6a, 6c, 
7a, 8a, 8b, 9a, 10a, 11a, lib, lid or nucleic acid molecules which hybridise to these 
sequences, wherein said nucleic acid molecules encode a polypeptide which has 
desaturase activity. 

[0009] In a preferred embodiment of the invention said hybridisation conditions 

are stringent hybridisation conditions. 

[0010] In a preferred embodiment of the invention said nucleic acid molecule 

comprises a nucleic acid sequence which has at least 30% homology to the nucleic acid 
sequence represented in Figures 5a, 5b, 6a, 6c, 7a, 8a, 8b, 9a, 10a, 11a, lib, or lid. 
Preferably said homology is at least 40%, 50%, 60%, 70%, 80%, 90%>, or at least 99% 
identity with the nucleic acid sequence represented in Figures 5a, 5b, 6a, 7a, 8a, 8b, 9a, 
10a, 1 la, or 1 lb and which encodes a polypeptide which has desaturase activity. 
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[0011] The sequence of desaturase nucleic acids may be modified to produce 

variant enzymes with enhanced expression in cells. For example, the addition of a 
codon that encodes an alanine amino acid may facilitate recombinant expression in 
microbial systems e.g. yeast. These modifications may not be required in all expression 
systems but is sometimes desirable. 

[0012] In a preferred embodiment of the invention said nucleic acid molecule 

comprises the nucleic acid sequence as represented in Figures 5a, 5b, 6a, 6c 7a, 8a, 8b, 
9a, 10a, 1 la, 1 lb, or 1 Id. Preferably said nucleic acid molecule consists of the nucleic 
acid sequence as represented in Figures 5a, 5b, 6a, 7a, 8a, 8b, 9a, 10a, 1 la, or 1 lb. 

[0013] In a further preferred embodiment of the invention said cell over- 

expresses said desaturase encoded by said nucleic acid molecule. 

[0014] In a preferred embodiment of the invention said over-expression is at 

least 2-fold higher when compared to a non-transformed reference cell of the same 
species. 

[0015] Preferably said over-expression is: at least 3-fold, 4-fold, 5-fold, 6-fold, 

7-fold, 8-fold, 9-fold, or at least 10-fold when compared to a non-transformed reference 
cell of the same species. 

[0016] In a preferred embodiment of the invention said nucleic acid molecule is 

a cDNA. 

[0017] In yet a further preferred embodiment of the invention said nucleic acid 

molecule is a genomic DNA. 

[0018] In a preferred embodiment of the invention said transgenic cell is 

transfected with a nucleic acid molecule comprising a nucleic acid sequence as 
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represented by Figure 10a and which encodes a desaturase polypeptide wherein said 
polypeptide has Al 1 -desaturase activity, or a nucleic acid molecule which hybridises to 
the nucleic acid molecule in Figure 10a and encodes a polypeptide with A 1 1 -desaturase 
activity. 

[0019] In an alternative preferred embodiment of the invention said transgenic 

cell is transfected with a nucleic acid molecule comprising a nucleic acid sequence as 
represented by Figure 8a and which encodes a desaturase polypeptide wherein said 
polypeptide has A6~desaturase activity, or a nucleic acid molecule which hybridises to 
the nucleic acid molecule in Figure 8a and encodes a polypeptide with A6-desaturase 
activity. 

[0020] In a preferred embodiment of the invention said transgenic cell is a 

eukaryotic cell. 

[0021] In an alternative preferred embodiment of the invention said cell is a 

prokaryotic cell. 

[0022] In a further preferred embodiment of the invention said eukaryotic cell is 

a plant cell. 

[0023] Plants which include a plant cell according to the invention are also 

provided as are seeds produced by said plants. 

[0024] In a preferred embodiment of the invention said plant is selected from: 

corn (Zea mays), canola (Brassica napus, Brassica rapa ssp.), flax (Linum 
usitatissimum), alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secede cerale), 
sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (Helianthus annus), wheat 
(Tritium aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato 
(Solarium tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium hirsutum), sweet 
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potato (Iopmoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut 
(Cocas nucifera), pineapple (Anana comosus), citris tree (Citrus spp.) cocoa 
(Theobroma cacao), tea (Camellia senensis), banana (Musa spp.), avacado (Persea 
americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifer indica), 
olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), 
macadamia (Macadamia inter grif olid), almond (Prunus amygdalus), sugar beets (Beta 
vulgaris), oats, barley, vegetables and ornamentals. 

[0025] Preferably, plants of the present invention are crop plants (for example, 

cereals and pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, 
barley, pea), and other root, tuber or seed crops. Important seed crops are oil-seed rape, 
sugar beet, maize, sunflower, soybean, sorghum, and flax (linseed). Horticultural plants 
to which the present invention may be applied may include lettuce, endive, and 
vegetable brassicas including cabbage, broccoli, and cauliflower. The present invention 
may be applied in tobacco, cucurbits, carrot, strawberry, sunflower, tomato, or pepper. 

[0026] Grain plants that provide seeds of interest include oil-seed plants and 

leguminous plants. Seeds of interest include grain seeds, such as corn, wheat, barley, 
rice, sorghum, rye, etc. Oil seed plants include cotton, soybean, safflower, sunflower, 
Brassica, maize, alfalfa, palm, coconut, etc. Leguminous plants include beans and peas. 
Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, 
lima bean, fava been, lentils, chickpea, etc. 

[0027] According to a further aspect of the invention there is provided a seed 

comprising a plant cell according to the invention. Preferably said seed is from an oil 
seed plant. 

[0028] According to a yet further aspect of the invention there is provided a 

reaction vessel comprising at least one polypeptide according to the invention, fatty acid 
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substrates and co-factors wherein said vessel is adapted for the desaturation of said fatty 
acids substrates. 



[0029] In a preferred embodiment of the invention said polypeptide is expressed 

by a cell according to the invention. 

[0030] Preferably said cell is a eukaryotic cell, for example a yeast cell. 

[0031] In an alternative preferred embodiment of the invention said cell is a 

prokaryotic cell. 

[0032] According to a further aspect of the invention there is provided a method 

to desaturate a fatty acid substrate comprising the steps of: 

i) providing a reaction vessel according to the invention; and 

ii) growing said cells contained in said reaction vessel under conditions 
which allow the desaturation of at least one fatty acid substrate. 

[0033] An embodiment of the invention will now be described by example only 

and with reference to the following tables and figures: 

[0034] Table 1 illustrates the composition of major fatty acids in T. pseudonana; 

[0035] Table 2 illustrates the major fatty acids of pYES and pYDESN yeast 

transformants with and without addition of exogenous saturated fatty acids; 

[0036] Table 3 illustrates the A6 desaturase activity of TpDESI compared to that 

of an homologous Phaeodactylum tricornutum desaturase; 

[0037] Figure 1 illustrates the predicted protein sequences with homology to 

front-end desaturases derived from the T. pseudonana draft genome. Sequence 
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alignments of 12 putative 7. pseudonana desaturases with other functionally 
characterised front-end desaturase enzymes identified three main blocks of homology 
that represent the functional domains of front-end acyl desaturases (A). The darker 
shaded box highlights the cytochrome b5 haem-binding domain and shaded boxes 
indicate three histidine boxes. See Material and Methods for Genbank accession number 
and source species of the functionally characterised enzymes. A phylogenetic tree of 
nine T. pseudonana desaturases with other enzymes was constructed (B). By removing 
the regions containing gaps (ambiguous alignment region), a dataset was created from 
an alignment originally made with clustalX. The tree was constructed from the dataset 
using Phylip3.5c software package and bootstrap analyses were carried out with 1000 
replicates. Only well supported nodes (over 70%) are indicated with bootstrap values. 
All branches are drawn to scale as indicated by the scale bar (=0.1 substitutions/site). 
TpDESN sequence is 477 amino acids long (C). The cytochrome b5 haem -binding 
domain is on a shaded background and the three histidine-boxes are framed; 

[0038] Figure 2 illustrates RT-PCR expression analysis of TpdesN. Cells were 

harvested at different stages of growth for total RNA extraction and cDNA synthesis 
(A). PCR was performed on cDNA derived from reverse transcribed RNA using TpdesN 
and 18s rRNA specific primer pairs (B). PCR was carried out on undiluted (lane 1) and 
five-fold serial dilutions (lane 2-4) of each cDNA. The 18S rRNA gene was used as a 
control of cDNA synthesis. EE: early exponential phase, LE: late exponential phase, ES: 
early stationary phase; 

[0039] Figure 3 illustrates GC analysis of FAMEs (fatty acid methyl esters) from 

yeast transformed with the empty plasmid pYES2 or the plasmid containing TpDESN. 
Invscl yeast strain transformed with either pYES2 (A) or pYDESN (B) were induced 
for three days at 20°C without supplementation before sampling for fatty acid analysis. I. 
S. internal standard (17:0). The experiment was repeated three times and results of a 
representative experiment are shown; 
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[0040] Figure 4 illustrates mass spectra of DMDS FAME adducts from 

pYDESN transformed yeast. Mass spectrum of the DMDS adduct of 16:1 A9 FAME, 
present in all yeast samples (A). Mass spectrum of the DMDS adduct of 16:1 Al 1 
FAME, which was only found in yeast transformed with pYDESN (B). Picolinyl esters 
with spectra characteristic of 16:1 All were also identified in these samples (data not 
shown); and 

[0041] Figure 5a is the genomic nucleic acid sequence of the desaturase A (SEQ 

ID NO: 3) from T. pseudonana; Figure 5b is the cDNA sequence of desaturase A (SEQ 
ID NO: 1); Figure 5c amino acid sequence (SEQ ID NO: 2); 

[0042] Figure 6a (SEQ ID NO: 4) is the genomic nucleic acid sequence of 

desaturase B from Thalassiosira pseudonana; Figure 6b is the partial amino acid 
sequence (SEQ ID NO: 5); Figure 6c is the cDNA sequence of desaturase B (SEQ ID 
NO: 6); and Figure 6d is the amino acid sequence of said cDNA sequence (SEQ ID NO: 

7); 

[0043] Figure 7a is the nucleic acid sequence of desaturase E from Thalassiosira 

psuedonana (SEQ ID NO: 8); Figure 7b is the amino acid sequence (SEQ ID NO: 9); 

[0044] Figure 8a is the nucleic acid sequence of desaturase I from Thalassiosira 

pseudonana (SEQ ID NO: 10); Figure 8b is the cDNA sequence (SEQ ID NO: 11); and 
Figure 8c is the amino acid sequence (SEQ ID NO: 12); 

[0045] Figure 9a is the nucleic acid sequence of desaturase K from 

Thalassiosira pseudonana (SEQ ID NO: 13); Figure 9b is the amino acid sequence 
(SEQ ID NO: 14); 

[0046] Figure 10a is the nucleic acid sequence of desaturase N from 

Thalassiosira pseudonana (SEQ ID NO: 15); 



WKLI 142294 v2 
2902076-000001 



9 



SUBSTITUTE SPECIFICATION 



CLEAN COPY 



[0047] Figure 11a is the nucleic acid sequence of desaturase O from 

Thalassiosira pseudonana (SEQ ID NO: 16); Figure lib is the cDNA sequence (SEQ 
ID NO: 17); Figure 11c is the amino acid sequence (SEQ ID NO: 18); Figure 1 Id is the 
nucleic acid sequence of desaturase O variant sequence from Thalassiosira pseudonana 
(SEQ ID NO: 19); and Figure lie is the amino acid sequence of said variant desaturase 
O (SEQ ID NO: 20); 

[0048] Figure 12A and 12B is a GC analysis of FAMEs from yeast expressing 

TpDESI with exogenous substrates 18:2A9,12 (A) and 18:3A9,12,15 (B). New FAs 
produced from endogenous and exogenous substrates are underlined; Figure 12C is a 
GC analysis of FAMEs from yeast transformed with a vector only control compared to 
yeast transformed with TpDESI; 

[0049] Figure 13 is an illustration of fatty acid synthesis pathways; and 

[0050] Figure 14 is a GC analysis of FAMEs from yeast expressing TpDESO. 

Materials and Methods 

Identification of putative Thalassiosira pseudonana desaturase-coding sequences 
and phylogenetic analysis with other functionally characterised desaturases 

[0051] The draft genome of the diatom T. pseudonana has been sequenced to 

approximately nine times coverage by the whole genome shotgun method. Sequence 
data were produced by the US Department of Energy Joint Genome Institute 
( http : //www . i gi . doe . go v/ ) and the raw sequence data were downloaded and installed on 
a local server. Batch tblastn searches were carried out using protein sequences of the 
following 13 known desaturases as query, including P1DES1 (AY332747, Pavlova 
lutheriX TFAD4 (AF489589, Thraustochytrium sp. ATCC 21685), TFAD5 (AF489588, 
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Thraustochytrium sp. ATCC 21685), PtDELS (AY082392, Phaeodactylum 
tricornutum), PtDEL6 (AY082393, Phaeodactylum tricornutum), EgDEL8 (AF139720, 
Euglena gracilis), EgDEL4 (AY278558, Euglena gracilis), ZfDEL (AF309556, Danio 
rerio), B0DEL6 (U79010, Borage officinalis), HsDELS (AF084558, Homo sapiens), 
HsDEL6 (AF084559, Homo sapiens), CeDEL6 (AF031477, Caenorhahditis elegans) 
and CeDEL5 (AF078796, Caenorhahditis elegans). 

[0052] All non-redundant sequences with an E value less than 0.001 were 

retrieved and assembled into contigs using the CAP3 sequence assembly program [18]. 
The contigs were translated into amino sequences in three frames in the orientation 
indicated by tblastn result. Putative desaturase gene models were constructed manually 
based on sequence homology and in frame GT-AG intron boundaries were identified. 

[0053] Deduced amino acid sequences of all 12 putative desaturase sequences of 

T. pseudonana were aligned with the above 13 functionally characterised desaturases 
from other species, using ClustalX version 1.8 [19]. The alignment was then reconciled 
and further adjusted. Only nine near full-length Thalassiosira sequences were retained 
for further analyses. 

[0054] A dataset of 250 conserved residue positions was used for construction of 

the phylogenetic tree. Distance analysis used the program protdist of the Phylip 3.5c 
package with a PAM250 substitution matrix and a tree was then built from the matrix 
using fitch (Fitch-Margoliash method). Bootstrap analyses were carried out with 1000 
replicates using the neighbour-joining algorithm. 

Cultivation of T. pseudonana 

[0055] T. pseudonana (CCAP 1085/12) was obtained from the Culture 

Collection of Algae and Protozoa (Dunstaffnage Marine Lab., Oban, PA34 4 AD, 
Scotland, U.K.). The growth medium used was enriched artificial seawater medium 
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(EASW), made up in 20 1 batches as described previously [4]. The cultures were grown 
in one litre flasks at 15°C with 50 jaE nT 2 s" 1 constant illumination, and aeration 
provided by shaking the flasks at 150 rpm. 

[0056] Cell density was monitored by counting cells with a haemocytometer. 

Nitrate concentration was determined periodically during the culture time by measuring 
the change of the medium absorbance at 220 nm [20]. 

RNA extraction, cDNA synthesis and RT-PCR analysis 

[0057] Total RNA was extracted from frozen cells harvested at different stages 

of growth with an RNeasy plant mini kit (Qiagen). First strand cDNA was synthesised 
from three fig of DNAse treated RNA using a Prostar First-strand RT-PCR kit 
(Stratagene). PCR was performed using undiluted and five-fold dilutions of cDNAs as 
followed: the reactions were heated to 95 °C for 5 min followed by 35 cycles at 95 °C 
for 30 s, 50 °C or 65°C (for J8S rRNA and TpdesN respectively) for 30 s and 72 °C for 2 
min, then a single 72 °C for 10 min. As a marker for constitutive expression, the 18S 
rRNA gene was amplified with the primer TH18S5' (5 ' -GGTA ACG AATTGTTAG-3 ' ) 
(SEQ ID NO: 21) and TH18S3' (5'-GTCGGCATAGTTTATG-3') (SEQ ID NO: 22). 
TpdesN cDNA was amplified using primers DESNR2 (5'- 
GTGAGAGCACTAACCAAGCTT-3') (SEQ ID NO: 23) and DESN2 (5'- 
CAATCAGTAGGCTTCGTC G-3') (SEQ ID NO: 24). Aliquots of PCR reaction were 
electrophoresed through a 1% agarose gel. Identity of the diagnostic fragment amplified 
with TpdesN specific primers was verified by sequencing after cloning in the pGEM-T 
Easy Vector (Promega). 

Functional characterisation of TpDESI in yeast 

[0058] The entire TpdesI coding region was amplified from T. pseudonana 

cDNA with primers DesINB 5'- 

GCG GGATCC ACCATGGCJXjGAAAAGGAGGAGAC^ (SEQ ID NO: 25) (ORF 

12 
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start codon is indicated by bold type; underlined sequence is a BamHl site; italic 
sequence is an added alanine codon, not present in the original sequence of Pldesl) and 
DesICE 5 ^GC GAATTC TTACATGGC AGGGAAATC-3 ' (SEQ ID NO: 26) (ORE stop 
codon is indicated in bold type; underlined sequence is a EcoRl site). The Expand High 
Fidelity PCR system (Roche) was employed to minimise potential PGR errors. The 
amplified product was gel purified, restricted and cloned into the corresponding sites 
behind the galactose-inducible GAL1 promoter of pYES2 (Invitrogen) to yield the 
plasmid pYDESl. This vector was transformed into S. cerevisiae strain Invscl 
(Invitrogen) by a lithium acetate method, and transformants were selected on minimal 
medium plates lacking uracil. 

[0059] For functional expression, cultures were grown at 25°C in the presence 

of 2% (w/v) raffmose and 1% (w/v) Tergitol NP-40 (Sigma). Expression of the 
transgene was induced when OD 6 oonm reached 0.2-0.3 by supplementing galactose to 2% 
(w/v). At that time, the appropriate fatty acids were added to a final concentration of 50 
fiM. Incubation was carried out at 25°C for three days. 

Functional characterisation of TpDESN in yeast 

[0060] Genomic DNA from T. pseudonana cells was extracted using the DNA 

isolation kit Puregene (Gentra Systems) and 100 ng was used to amplify the entire 
TpdesN coding region with primers DesNNB 5'- 

GCGGGATCCACC ATGGCrG ACTTTCTCTCCGGC-3 ' (SEQ ID NO: 27) (ORF start 
codon is indicated by bold type; underlined sequence is a BamHl site; italic sequence is 
an added alanine codon, not present in the original sequence of TpdesN) and DesNCE 
5 '-GCGAATTCTCAATCAGTAGGCTTCGT-3 ' (SEQ ID NO: 28) (ORF stop codon is 
indicated in bold type; underlined sequence is a EcoRl site). The Expand High Fidelity 
PCR system (Roche) was employed to minimise potential PCR errors. The amplified 
product was gel purified, restricted with EcoRl and BamHl and cloned into the 
corresponding sites behind the galactose-inducible GALl promoter of pYES2 
(Invitrogen) to yield the plasmid pYDESN. The fidelity of the cloned PCR product was 

13 
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checked by sequencing. The vector pYDESN was then transformed into S. cerevisiae 
strain Invscl (Invitrogen) by a lithium acetate method, and transformants were selected 
on minimal medium plates lacking uracil. 

[0061] For the feeding experiment with PUFAs, cultures were grown at 22°C in 

the presence of 2% (w/v) raffmose and 1% (w/v) Tergitol NP-40 (Sigma). Expression of 
the transgene was induced when OD 6 oonm reached 0.2-0.3 by supplementing galactose to 
2% (w/v). At that time, the appropriate fatty acids were added to a final concentration of 
50 /xM. Incubation was carried out at 22°C for three days and then 15°C for another 
three days. For the feeding experiment with saturated fatty acids, a single Invscl 
colony transformed with pYES2 (empty plasmid, control) or pYDESN was inoculated 
in 10 ml of minimal media minus uracil containing 2% raffmose and grown overnight at 
30°C with shaking (300 rpm). After 16-24 h, cells were collected by spinning at 4500 
lpm for 10 min. After discarding the supernatant, the cell pellet was resuspended in the 
same medium mentioned above supplemented with 2% galactose and 1% tergitol, to 
obtain a cell density of 5xl0 7 cells/ml. Fifteen ml of this cell suspension were added to a 
100 ml-flask with or without addition of saturated fatty acids (as mentioned in the text) 
at 500 jiM final concentration. Desaturase induction was then carried out at 20°C with 
shaking (300 rpm) for three days. 

Fatty acid analysis 

[0062] Microalgae or yeast cells were harvested by centrifugation. Total fatty 

acids were extracted and transmethylated as previously described [4]. Most FAMEs 
were identified by comparison of retention times to a 37 FAME mix (Supelco). PUFA 
FAMEs were also identified by comparison to a sample of standard Menhaden oil 
(Supelco) transmethylated as per the samples. 

[0063] Dimethyl disulphide (DMDS) adducts were used to determine the double 

bond position in identified and unidentified monounsaturated FAMEs. These were 
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made by adding together 50 jliI DMDS (Aldrich), 100-1000 ng FAMEs dissolved in 50 
jitl hexane, and 5 jliI 50 mg ml" 1 iodine in diethyl ether. This solution was heated at 40°C 
for 15 h and partitioned with 200 )il hexane and 100 fil 5% (w/v) sodium thiosulphate. 
The hexane phase was removed, dried under vacuum, reconstituted in 50 (il fresh 
hexane and used for GC-MS analysis. A Trace GC 2000 (ThermoQuest) fitted with a 
30 m x 0.25 mm x 0.5 jim film thickness ZB-1 column (Phenomenex) was used to 
chromatograph 2 jil DMDS adducts injected at 250°C and a 50:1 split ratio with Fie as 
carrier gas at 0.6 ml min" 1 in constant flow mode. The oven program was 120°C for 
lmin then to 340°C at 5°C min" 1 . Mass spectra were obtained using a GCQ 
(ThermoQuest) mass spectrometer operating in full scan mode over 50-500 m/z. 
Picolinyl esters were also made from FAMEs to confirm their identities. These were 
obtained by adding 15 jllI freshly prepared 2:1 (v/v) 3-(hydroxymethyl)-pyridine 
(Aldrich): potassium tert butoxide 1 M solution in tetrahydrofuran (Aldrich) to 50 jiil 
FAMEs dissolved in dichloromethane. This solution was heated at 40°C for 30 min and 
partitioned with 200 |xl hexane and 100 \x\ 2.5% (w/v) sodium hydrogen carbonate. The 
hexane phase was removed, dried under vacuum and reconstituted in 50 \il fresh 
hexane. Picolinyl esters were injected and separated by GC-MS using the same 
conditions as for DMDS adducts; Sperling P., Zahringer U. and Heinz E. (1998) A 
sphingolipid desaturase from higher plants. J. Biol. Chem. 273, 28590-28596; Sperling 
P., Libisch B., Zahringer U., Napier J. A. and Heinz E. (2001) Functional identification 
of a D8-sphingolipid desaturase from Borago officinalis. Arch. Biochem. Biophys. 388, 
293-298; Whitney H.M., Michaelson, L.V., Sayanova, O., Pickett J.A. and Napier, J.A. 
(2003) Functional characterization of two two cytochrome b5-fusioii desaturases from 
Anemone leveillei: The unexpected identification of a fatty acid A6-desaturase. Planta 
217, 983-992; each of which are incorporated by reference. 
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EXAMPLES 
EXAMPLE 1 

Identification and phylogenetic analysis of putative T. pseudonana desaturase 
sequences with other functionally characterised desaturases 

[0064] Tblastn searches with 13 functionally characterised desaturases revealed 

427 non-redundant raw sequences with E values less than 0.001. Twelve unique contigs 
were assembled after retrieving these sequences and gene models were constructed 
manually based on sequence homology. These 12 gene contigs were arbitrarily 
designated TpdesA to TpdesL. All 12 showed significant sequence similarity to query 
sequences with 9 containing near full length open reading frames compared to other 
known desaturases (Fig. 1A). Interestingly, the predicted amino acid sequence of all 
nine T. pseudonana desaturases have a characteristic fused cytochrome b5 haem-binding 
domain (HP[G/A]G) at their N-terminus and three histidine boxes (H[X]3-4H, H[X]2- 
3HH AND Q[X]2-3HH) with the replacement of the first histidine by glutamine in the 
third histidine box in all but two of the predicted proteins (TpDESA and TpDESB). 
These are common characteristics of a large subgroup of front-end acyl group 
desaturases [21]. These histidine-box motifs are critical for desaturase activity, most 
likely because they serve to coordinate the diiron-oxo component of the active site. 
Three remaining sequences (TpDESD, TpDESL and TpDESH) appear to be partial, 
covering only the C-terminal end of desaturases, but nevertheless they do contain a 
typical third histidine box of the above mentioned subgroup of desaturases (Fig. 1 A). 

[0065] In order to gain insight into the relationships of these T. pseudonana 

sequences to other functionally characterised desaturases and especially algal 
desaturases, we constructed an unrooted phylogenetic tree using a Fitch-Margoliash 
method with statistical confidence measured by bootstrap analysis (Fig. IB). 
Relationships of four putative T. pseudonana desaturases are in well supported (>70% 
bootstrap value) subgroups with at least one functionally characterised desaturase from 
other species. Both TpDESM and TpDESO grouped with PtDEL5, a A5-desaturase 
from another diatom, P. tricornutum [5], suggesting these two enzymes may also have a 
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A5-desaturase activity. Similarly TpDESK is grouped with two A4~desaturases TFAD4 
and EgDEL4 from Thraustochytrium sp. ATCC21685 [22] and E. gracilis respectively. 
TpDESI grouped with PtDEL6, a A6-desaturase from P. tricornutum. This indicates that 
TpDESK and TpDESI may have A4 and A6-desaturase activities respectively. However, 
as enzymes with different regioselectivities are also found in a well supported subgroup 
(EgDEL8, CeDELS and CeDEL6; A8, A5 and A6-desaturase respectively) and 
regioselectivity may even derive independently after a more recent duplication (CeDEL5 
and CeDEL6) [23] predictions based on homology can be misleading and it is essential 
to functionally characterise each enzyme. 

[0066] The remaining five T pseudonana sequences fall into three separate 

subgroups (TpDESE; TpDESA and TpDESB; TpDESG and TpDESN) which do not 
group with any other known functional desaturases with high confidence. It is therefore 
possible that these proteins exhibit novel regioselectivity. The current study focussed on 
the characterisation of one of these proteins, TpDESN. 

EXAMPLE 2 

Temporal expression of TpDESN gene 

[0067] RT-PCR analysis of TpdesN transcript was conducted at different stages 

of algal growth in order to establish if and when this gene is expressed. After RNA 
extraction and cDNA synthesis, TpdesN specific PCR products were amplified. PCR 
amplification of the 18S rDNA gene was performed as a control for the quantity of 
cDNA used during PCR reactions. Figure 2 shows that the diagnostic 519 bp cDNA 
amplification product expected for TpdesN was present at similar level at the different 
stages of cultivation of the microalga cells. Thus, TpdesN is transcriptionally active at a 
constitutive level during Thalassiosira growth, suggesting that it may encode a 
desaturase with a housekeeping function. 
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EXAMPLE 3 

Functional characterisation of TpDESN in yeast 

[0068] The putative desaturase sequence annotated TpdesN was contained on a 

genomic DNA contig of 2580 bp on which no introns was detected. To establish the 
function of the protein encoded by this gene, the full-length sequence was amplified 
from genomic DNA. An alanine codon containing a G as the first letter was added 
immediately downstream of the start codon of TpdesN to ensure optimal translation in 
yeast [24]. The TpdesN ORF is 1434 bp long, and encodes a 477 amino acid protein 
TpDESN (Fig. 1C), having a molecular weight of 53.8 kDa. Analysis of the secondary 
structure of TpDESN using SOSUI software 

(http://sosui.proteome.bio.tuat.ac.jp/sosuiframeO.html) [25] predicted four 
transmembrane regions (not shown). Alignment of TpDESN with functionally 
characterised desaturase sequences mentioned above indicated an overall identity of 
25%, with the cytochrome M-like domain and the three conserved histidine-rich motif 
areas showing greatest homology. 

[0069] The primary sequence of TpDESN exhibited typical features of front-end 

desaturases involved in PUFA synthesis. In order to characterise the specificity of this 
protein, PUFAs (18:2A9,12; 20:2A11,14; 20:3A8,1 1,14; 22:4A5,8,1 1,14; 18:3A9,12,15; 
20:3A1 1,14,17; 20:4A8,1 1,14,17; 22:5A7,10,13, 16,19) where first fed to the host yeast 
transformed with pYDESN and the vector alone (pYES2) as a control. Unexpectedly, 
after six days of incubation, TpDESN did not desaturate any of the supplemented PUFA 
substrate. Furthermore, there did not appear to be any production of 18:2A9,12 from 
endogenous 18:1A9. However, a significant increase was observed for a peak eluting in 
the range of sixteen carbon monounsaturated FAMEs in the yeast transformed with 
pYDESN (Fig. 3). The position of the double bond in this product was determined by 
GC-MS analysis of FAMEs derived to DMDS adducts [26] and picolinyl esters. The 
DMDS adduct of 1 6: 1 A9 FAME yielded two major fragments at m/z 145 and 217 (Fig. 
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4A). Fragmentation of the increased FAME peak found in unfed or fed yeast 
transformed with pYDESN produced two diagnostic fragments at m/z 117 and 245 (Fig. 
4B). This fragmentation pattern was indicative of an All monounsaturated sixteen 
carbon FAME, 16:1 All, suggesting that TpDESN encoded a new Al 1-desaturase. 
Small amounts of this FA have also been measured in Thalassiosira cells (Table 1). To 
further substantiate these results, yeast transformed with pYDESN and the control 
empty vector, pYES2, were cultivated in medium supplemented with saturated FA 
(14:0; 16:0; 18:0) representing potential substrates for the synthesis of the 
monounsaturated product. Yeast fatty acid profiles were analysed after three days of 
incubation at 20°C. Results in Table 2 showed that a small amount of 1 6: 1 A 1 1 (0.23% 
of total FAs) was detected in yeast transformed with pYES2, suggesting endogenous 
synthesis of this FA from 16:0. This FA accumulated at a higher level in both types of 
transformed yeast after feeding with 14:0, with values up to 5.84% in pYDESN 
transformants. A possible explanation for this increase in the pYES2 transformants is 
that the endogenous yeast A9-desaturase was able to use additional 14:0 to produce 
14:1 A9 that was subsequently elongated to 16:1 Al 1. Moreover, it has been reported that 
wild type yeast cells cultivated in media supplemented with 14:1 A9 synthesised 16:1 
Al 1 by Elolp-dependent carboxy terminal elongation [27]. After 18:0 supplementation, 
the percentage of 16:1 All, of about 6% total FAs, was similar to that observed after 
feeding with 16:0. Presence of extra 18:0 could lead to an inhibition of the 16:0 chain 
elongation system, which might allow more 16:0 to be available for Al 1 -desaturation. 
On the other hand, 18:1 All represents 1.2% of the total FAs in transgenic yeast. No 
variation in its proportion was monitored under the different conditions of incubation, 
even after supplementation with 18:0 in pYDESN transformants. This suggests that this 
FA originates from elongation of 16: 1 A9 rather than Al 1 -desaturation of 18:0. 

EXAMPLE 4 

Functional characterisation of TpDESI in yeast 

[0070] To establish the function of TpDESI, the full-length cDNA was 

expressed in the yeast Invscl under the control of an inducible galactose promoter. 
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Potential substrates of front-end desaturases (18:2A9,12; 18:3A9,12,15; 20:3A8,1 1,14; 
20:4A8,1 1,14,17; 22:4A7,10,13,16; 22:5A7,10,13, 16,19) were tested. Figure 12A and 
12B show that after supplementation of the medium with 18:2A 9,12 and 18:3A 9,12,15 
respectively, and after three days of incubation, yeast cells containing pYDESI had extra 
fatty acids. Extra peaks observed when cells were fed with 18:2A9,12 had a retention 
time identical to 16:2A6,9, 18:2A6,12 and 18:3A6,9,12 (Figure 12A). Extra peaks 
observed when cells were fed with 18:3A9,12,15 had a retention time identical to 
16:2A6,9, 18:2A6,12 and 18:4A6,9,12,15 (Figure 12B). These results demonstrate that 
TpdesI encodes a A6-desaturase which can introduce double bond in exogenously fed 
18:2A9,12and 18:3A9,12,15 fatty acids, but also in endogenous 16:1A9 and 18:1 A9 fatty 
acids. Percentages of conversion of these different substrates are given in Table 3. 

[0071] Fatty acid profiling of marine microalgae had shown that T. pseudonana 

represents a good candidate to discover genes involved in the production and storage of 
PUFAs [4]. Analysis of the recently completed draft genome of this microalga revealed 
the presence of many candidate genes for elongase and desaturase activities most 
probably involved in catalysing different steps of the PUFA biosynthetic process. We 
have identified 12 possible desaturase genes, 9 of which there is sufficient sequence 
information to demonstrate that they exhibit typical features of front-end desaturases, 
i.e. a cytochrome b5 domain in the N-terminus and three histidine clusters located at 
highly conserved regions. Phylogenetic analysis revealed that several of the genes are 
closely related to a number of previously characterised front-end desaturases involved in 
PUFA synthesis. However, the current work highlights the fact that desaturase function, 
in terms of regioselectivity, cannot solely be based on prediction from primary amino 
acid sequence homology. 

[0072] The fatty acid profile of T. pseudonana cells is quite diverse (Table 1), 

with the health beneficial EPA (20:5 A5, 8,1 1,14,17) and DHA (22:6A4,7,10,13,16,19) 
accounting for a large proportion. Flowever, the number of desaturase gene sequences 
found in the genome was higher than we expected based on the number of different 
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desaturation reactions required to produce the diversity of FA in this microalga. This 
suggested that non-obvious desaturation reactions might also occur in the Thalassiossira 
cells. As a first step to establishing function of the many putative desaturase sequences, 
we focused on the TpdesN contig due to the fact that the sequence was full-length and 
intronless. A temporal expression study showed that TpdesN was constitutively 
transcribed during algal cultivation. Expression of the TpdesN ORF in yeast 
supplemented with PUFAs as potential substrates for desaturation revealed no new 
products. There was also no evidence of activity with the endogenous 18: 1 A9 which 
excludes the possibility that TpDESN acts as a A12-desaturase. However, an increase in 
the peak area of a FAME eluting in the range of the sixteen carbon FAMEs was 
identified and GC-MS based analysis revealed this to be 16:1 All fatty acid. Small 
amounts of this FA are also present in wild type yeast. However, quantitative 
comparison of FA levels in the empty vector pYES2 and pYDESN transformants 
showed that proportions of 16:1 Al 1 increased in the presence of TpdesN in both unfed 
cells and cells that had been fed different saturated FA. No other changes in either peak 
area or new peaks were detected in pYDESN transformants, indicating that TpDESN is 
specifically involved in conversion of 16:0 to 16:1 Al 1. 

[0073] The presence of small amounts of 16:1 All have previously been 

reported in many microalgae, including T pseudonana. However, a function for this FA 
in algal cells has not been established. The low quantity observed in many marine 
microalgae suggests that it may act as an intermediate in an as yet unidentified 
biosynthetic pathway. In insect cells, 16:1 All represents an important precursor for 
pheromone synthesis, where it is produced by an acyl-CoA Al 1 -desaturase. 
Interestingly, the insect A 1 1 -desaturases do not possess a cytochrome b5 domain in their 
N-terminal region. This represents a major primary structure difference compared with 
TpDESN. The cytochrome h5 domain is not a determinant of the substrate specificity 
[28]. Alignment of the desaturase domain of TpDESN with the full sequence of insect 
All -desaturases showed an identity of 20% (data not shown). In insect cells, All- 
desaturases are more or less specific depending on the origin of the sequence and well- 
documented reviews exist on this subject [14,15]. 
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[0074] In conclusion therefore, although the TpDESN primary sequence is very 

similar to front-end desaturases, it should not be considered a member of this family of 
desaturases because it acts only on 16:0. Identification of such a novel enzyme expands 
the functional repertoire of the membrane-bound desaturases and it should provide 
useful comparative information for understanding phylogenetic relationships between 
these enzymes. One question that remains to be answered regards whether cytochrome 
b5 was independently fused to desaturases that had already acquired their different 
specificities, or whether an ancestral fusion protein for proximal lipid modification 
duplicated and subsequently evolved into different desaturases. Studies of the primary 
structure of the different PUFA desaturases support the fact that enzyme conversion (i.e. 
change of specificity) can be achieved through a relatively few structural changes [29]. 
The high degree of homology between the many potential front-end desaturases 
identified in the genome of T. pseudonana support this notion. Given the FA profile of 
T. pseudonana cells and the complexity of the desaturase gene family it is likely that 
different genes will encode A4, A5 and A6 desaturases. It will now be very interesting to 
functionally characterise these remaining putative desaturase genes and study the 
relationship between regioselectivity, primary amino acid sequence and phylogenetic 
relationship. A crystal structure for these enzymes is still not available due to technical 
difficulties in obtaining sufficient quantities of purified membrane-bound protein. 
Molecular genetic approaches involving site-directed mutagenesis have provided new 
insight into structure-function relationships, including for example that residues in close 
proximity to the histidine motifs have been found to be involved in shifting the ratio of 
desaturation/hydroxylation activities [30]. Detailed comparative analyses and computer 
modeling of these diverse desaturases from T. pseudonana may further guide site- 
directed mutagenesis studies aimed at defining key residues controlling substrate 
specificity and regioselectivity of the introduced double bond. 
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